Usable Result

SAM2 Image Segmentation on Video Stream

The SAM2 image segmentation model was applied to a video stream to generate segments for each frame. The following steps were taken to ensure accurate segmentation:

Segment Generation:
- The image segmentation function was used to generate segments for each frame of the video.
Segment Sorting:
- After generating segments, each segment was sorted in negative space to highlight the largest one, ensuring that the subject of interest remained prominent.
Final Output:
- The largest segment was then adjusted so that the subject appeared in white, and the background was rendered in black.

As a result, the segmentation process successfully isolated the subject from the background, providing a clear distinction between them.

Using the same to prompt more than one subjects:

Using the same to prompt multiple subjects:

Point to note:

Issues with Applying SAM Image Masking on Video Frames

Separating the video into frames and applying the SAM image masking on each frame individually encountered the following challenges:

Loss of Prompting Ability:
- The process fails to track all masks (objects) across frames effectively. This leads to difficulties in maintaining consistent tracking throughout the video.
High Computational Power Requirements:
- The computational demand for processing each frame is significant. During testing, it was observed that the system could process only 4 frames in approximately 35 minutes before the computer crashed due to resource exhaustion.

SAM2 Image Segmentation on Video Stream​

Using the same to prompt more than one subjects:​

Using the same to prompt multiple subjects:​

Point to note:​

Issues with Applying SAM Image Masking on Video Frames​

SAM2 Image Segmentation on Video Stream

Using the same to prompt more than one subjects:

Using the same to prompt multiple subjects:

Point to note:

Issues with Applying SAM Image Masking on Video Frames